Serveur d'exploration autour du libre accès en Belgique

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Mixing Statistical and Symbolic Approaches for Chemical Names Recognition

Identifieur interne : 000147 ( France/Analysis ); précédent : 000146; suivant : 000148

Mixing Statistical and Symbolic Approaches for Chemical Names Recognition

Auteurs : Florian Boudin [France] ; Manuel Torres-Moreno [France, Canada] ; Marc El-Bèze [France]

Source :

RBID : ISTEX:8511A82D0D8C02A599403A36D5B8D7894ECE64D3

Abstract

Abstract: This paper investigates the problem of automatic chemical Term Recognition (TR) and proposes to tackle the problem by fusing Symbolic and statistical techniques. Unlike other solutions described in the literature, which only use complex and costly human made ruled-based matching algorithms, we show that the combination of a seven rules matching algorithm and a naïve Bayes classifier achieves high performances. Through experiments performed on different kind of available Organic Chemistry texts, we show that our hybrid approach is also consistent across different data sets.

Url:
DOI: 10.1007/978-3-540-78135-6_28


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

ISTEX:8511A82D0D8C02A599403A36D5B8D7894ECE64D3

Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Mixing Statistical and Symbolic Approaches for Chemical Names Recognition</title>
<author>
<name sortKey="Boudin, Florian" sort="Boudin, Florian" uniqKey="Boudin F" first="Florian" last="Boudin">Florian Boudin</name>
</author>
<author>
<name sortKey="Torres Moreno, Manuel" sort="Torres Moreno, Manuel" uniqKey="Torres Moreno M" first="Manuel" last="Torres-Moreno">Manuel Torres-Moreno</name>
</author>
<author>
<name sortKey="El Beze, Marc" sort="El Beze, Marc" uniqKey="El Beze M" first="Marc" last="El-Bèze">Marc El-Bèze</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:8511A82D0D8C02A599403A36D5B8D7894ECE64D3</idno>
<date when="2008" year="2008">2008</date>
<idno type="doi">10.1007/978-3-540-78135-6_28</idno>
<idno type="url">https://api.istex.fr/document/8511A82D0D8C02A599403A36D5B8D7894ECE64D3/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001045</idno>
<idno type="wicri:Area/Istex/Curation">001021</idno>
<idno type="wicri:Area/Istex/Checkpoint">000E66</idno>
<idno type="wicri:doubleKey">0302-9743:2008:Boudin F:mixing:statistical:and</idno>
<idno type="wicri:Area/Main/Merge">001383</idno>
<idno type="wicri:Area/Main/Curation">001380</idno>
<idno type="wicri:Area/Main/Exploration">001380</idno>
<idno type="wicri:Area/France/Extraction">000147</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Mixing Statistical and Symbolic Approaches for Chemical Names Recognition</title>
<author>
<name sortKey="Boudin, Florian" sort="Boudin, Florian" uniqKey="Boudin F" first="Florian" last="Boudin">Florian Boudin</name>
<affiliation wicri:level="1">
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire Informatique d’Avignon, 339 chemin des Meinajaries, BP1228, 84911, Avignon, Cedex 9</wicri:regionArea>
<wicri:noRegion>Cedex 9</wicri:noRegion>
<wicri:noRegion>Cedex 9</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="Torres Moreno, Manuel" sort="Torres Moreno, Manuel" uniqKey="Torres Moreno M" first="Manuel" last="Torres-Moreno">Manuel Torres-Moreno</name>
<affiliation wicri:level="1">
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire Informatique d’Avignon, 339 chemin des Meinajaries, BP1228, 84911, Avignon, Cedex 9</wicri:regionArea>
<wicri:noRegion>Cedex 9</wicri:noRegion>
<wicri:noRegion>Cedex 9</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country xml:lang="fr">Canada</country>
<wicri:regionArea>Département de génie informatique, École Polytechnique de Montréal, CP 6079 Succ. Centre Ville, H3C 3A7, Montréal (Québec)</wicri:regionArea>
<wicri:noRegion>Montréal (Québec)</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="El Beze, Marc" sort="El Beze, Marc" uniqKey="El Beze M" first="Marc" last="El-Bèze">Marc El-Bèze</name>
<affiliation wicri:level="1">
<country xml:lang="fr">France</country>
<wicri:regionArea>Laboratoire Informatique d’Avignon, 339 chemin des Meinajaries, BP1228, 84911, Avignon, Cedex 9</wicri:regionArea>
<wicri:noRegion>Cedex 9</wicri:noRegion>
<wicri:noRegion>Cedex 9</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2008</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">8511A82D0D8C02A599403A36D5B8D7894ECE64D3</idno>
<idno type="DOI">10.1007/978-3-540-78135-6_28</idno>
<idno type="ChapterID">28</idno>
<idno type="ChapterID">Chap28</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: This paper investigates the problem of automatic chemical Term Recognition (TR) and proposes to tackle the problem by fusing Symbolic and statistical techniques. Unlike other solutions described in the literature, which only use complex and costly human made ruled-based matching algorithms, we show that the combination of a seven rules matching algorithm and a naïve Bayes classifier achieves high performances. Through experiments performed on different kind of available Organic Chemistry texts, we show that our hybrid approach is also consistent across different data sets.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Canada</li>
<li>France</li>
</country>
</list>
<tree>
<country name="France">
<noRegion>
<name sortKey="Boudin, Florian" sort="Boudin, Florian" uniqKey="Boudin F" first="Florian" last="Boudin">Florian Boudin</name>
</noRegion>
<name sortKey="Boudin, Florian" sort="Boudin, Florian" uniqKey="Boudin F" first="Florian" last="Boudin">Florian Boudin</name>
<name sortKey="El Beze, Marc" sort="El Beze, Marc" uniqKey="El Beze M" first="Marc" last="El-Bèze">Marc El-Bèze</name>
<name sortKey="El Beze, Marc" sort="El Beze, Marc" uniqKey="El Beze M" first="Marc" last="El-Bèze">Marc El-Bèze</name>
<name sortKey="Torres Moreno, Manuel" sort="Torres Moreno, Manuel" uniqKey="Torres Moreno M" first="Manuel" last="Torres-Moreno">Manuel Torres-Moreno</name>
<name sortKey="Torres Moreno, Manuel" sort="Torres Moreno, Manuel" uniqKey="Torres Moreno M" first="Manuel" last="Torres-Moreno">Manuel Torres-Moreno</name>
</country>
<country name="Canada">
<noRegion>
<name sortKey="Torres Moreno, Manuel" sort="Torres Moreno, Manuel" uniqKey="Torres Moreno M" first="Manuel" last="Torres-Moreno">Manuel Torres-Moreno</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Belgique/explor/OpenAccessBelV2/Data/France/Analysis
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000147 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/France/Analysis/biblio.hfd -nk 000147 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Belgique
   |area=    OpenAccessBelV2
   |flux=    France
   |étape=   Analysis
   |type=    RBID
   |clé=     ISTEX:8511A82D0D8C02A599403A36D5B8D7894ECE64D3
   |texte=   Mixing Statistical and Symbolic Approaches for Chemical Names Recognition
}}

Wicri

This area was generated with Dilib version V0.6.25.
Data generation: Thu Dec 1 00:43:49 2016. Site generation: Wed Mar 6 14:51:30 2024